Estimating Joint Probabilities from Marginal Ones

نویسندگان

  • Tao Li
  • Shenghuo Zhu
  • Mitsunori Ogihara
  • Yinhe Cheng
چکیده

Estimating joint probabilities plays an important role in many data mining and machine learning tasks. In this paper we introduce two methods, minAB and prodAB, to estimate joint probabilities. Both methods are based on a light-weight structure, partition support . The core idea is to maintain the partition support of itemsets over logically disjoint partitions and then use it to estimate joint probabilities of itemsets of higher cardinalitiess. We present extensive mathematical analyses on both methods and compare their performances on synthetic datasets. We also demonstrate a case study of using the estimation methods in Apriori algorithm for fast association mining. Moreover, we explore the usefulness of the estimation methods in other mining/learning tasks [9]. Experimental results show the effectiveness of the estimation methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Second-order estimating equations for the analysis of clustered current status data.

With clustered event time data, interest most often lies in marginal features such as quantiles or probabilities from the marginal event time distribution or covariate effects on marginal hazard functions. Copula models offer a convenient framework for modeling. We present methods of estimating the baseline marginal distributions, covariate effects, and association parameters for clustered curr...

متن کامل

Independence and 2-Monotonicity: Nice to Have, Hard to Keep

When using lower probabilities to model uncertainty about the value assumed by a variable, 2-monotonicity is an interesting property to satisfy, as it greatly facilitates further treatments (such as the computation of lower/upper expectation bounds). In this paper, we show that multivariate joint models induced from marginal ones by strong independence, epistemic independence or epistemic irrel...

متن کامل

Estimating predicted probabilities from logistic regression: different methods correspond to different target populations.

BACKGROUND We review three common methods to estimate predicted probabilities following confounder-adjusted logistic regression: marginal standardization (predicted probabilities summed to a weighted average reflecting the confounder distribution in the target population); prediction at the modes (conditional predicted probabilities calculated by setting each confounder to its modal value); and...

متن کامل

Estimating Probability Distributions over Hypotheses with Variable Unification

We analyze the difficulties in applying Bayesian belief networks to language interpretation domains, which typically involve many unification hypotheses that posit variable bindings. As an alternative, we observe that the structure of the underlying hypothesis space permits an approximate encoding of the joint distribution based on marginal rather than conditional probabilities. This suggests a...

متن کامل

DoF Analysis of the K-user MISO Broadcast Channel with Alternating CSIT

We consider a K-user multiple-input single-output (MISO) broadcast channel (BC) where the channel state information (CSI) of user i(i = 1, 2, . . . ,K) may be either perfect (P), delayed (D) or not known (N) at the transmitter with probabilities λP , λ i D and λ i N , respectively. In this channel, according to the three possible CSIT for each user, joint CSIT of the K users could have at most ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002